1 

SPECIFICATION 
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learning by means of sound identification" 
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The path to leammg a foreign language is full of obstacles for the adult learner. 
Learners usually reach situations in which the language they are learning phosilizes, ending up 
in a status that is very distant from the target language they were trying to learn. 

The most difficult aspect is pronunciation, i.e. phonetics and phonology. It is 
5 extremely unusual for an adult to obtain native like pronunciation. This is a difficult aspect of 
language. When compared to syntax, there are very few reports on how phonetics and 
phonology are learned, and there are also very few solutions about how to solve this problem. 

A view about the difficulty in learning phonetics and phonology is that the problem is 
the lack of training of the vocal tract. The usual pedagogical recommendation that is associated 
10 to this approach is to utter oral fragments of the target language in a repetitive fashion. 
Sometimes, diagrams are showed that explain the learner how he/she must use the muscles of 
his/her vocal tract. 

Some pedagogical products try to assist the learner to develop his/her pronunciation 
by performing an automated evaluation of the waveform of the sound that they generate. In 
15 these cases, the learner can use this mechanism to evaluate the quality of his/her pronunciation 
[Auralog]. 

Another approach to this problem is the one by Alfred Tomatis [Tomatis]. Tomatis 
suggested, as a result of his scientific research, that the problem is actually located in the ear. 
He proposes a pedagogical system that is based on training the ear in a certain fashion, which 
20 is based on inputting sound samples in which some frequencies have been intensified and 
others have been attenuated. The idea is that this action will reduce the atrophy of those areas, 
and that the subject will be able to better perceive the sounds of language, and therefore will 
also be able to better produce them. 



25 Unfortunately, none of these approaches is the definitive solution for the learning of 

the pronunciation in foreign languages. The current invention uses the existing scientific 
knowledge about language learning to identify the area where the main problem lies, and 
proposes an approach to solve it. 



30 EXPLANATION OF THE INVENTION 

PROBLEM ANALYSTS 

Difficulties in IcarniniJ pronunciation 

Learning pronunciation is based on two aspects: 

• Learning phonetics , i.e. learning the features that characterize the individual sounds that 
35 make up speakers' utterances. These features are basically associated to the profile of the 
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frequency power spectrum of the sound. In order for the learner to learn these features, 
he/she must first identify them and then remember them. 
• Learning phonology , i.e. learning how the sounds of a word or of a sentence influence each 
other and how they relate among themselves. Also, how the temporal evolution of certain 
5 features, such as frequency or intensity, contribute to convey the structure and the meaning 

of language. Because phonology is based on sounds' features, learning phonology is based 
on learning phonetics. 

Very young infants, of just a few weeks or months old, can identify the features of the 
10 sounds of all languages. However, this ability becomes gradually lost, despite it has been seen 
that there always remains a latent capacity that can be ti'ained. 

The consequence of this difficulty to identify sounds is that it is not possible to 
remember them because. How is it possible to remember something that cannot be identified?. 
Simultaneously, the inability to learn phonetics causes a concomitant inability to learn 
15 phonology. That is to say, if individual sounds cannot be learned. How could it be possible to 
learn the relationships among those sounds. 

As a result of these difficulties to learn phonetics and phonology, the learner faces 
three fundamental problems: 

20 - The learner experiences serious problems to learn words, because learning a word is 
learning a sequence of sounds, and those sounds are difficult to learn. 
- Because the learner does not learn phonology, he/she also finds serious difficulties in 
discriminating the words in an automated and unconscious way. Much scientific work has 
been done about the process that the brain uses to identify words. An important part of this 

25 process lies on cues provided by phonology. For example, the sequence of Spanish sounds 
"lavacalentita" can coiTesponds to two different word groups: "la vaca lentita" or "lava 
calentita". Phonology allows the listener to discriminate the right words in this case [Quilis 
et al 1975]. Therefore, when the learner has problems with phonology, he/she also has 
problems in conectly discriminating words. 

30 - Because the learner does not learn phonology, he/she also loses the ability to identify the 
syntactic structure of the messages that are listened to. Science has shown that the 
information about such stmcture is conveyed by the changes in frequency, intensity and 
duration of the sounds of language. 



4 



In order to overcome the previous difficulties, it is customaiy to resort to written 
texts. For the learner, it is much easier to remember the wiitten form of a word than the oral 
form, because in most cases written characters can be easily identified and memorized. Besides, 
the learner has enough time to analyze the words that exist in a given sentence, and elucidate 
5 the structure of the sentence. 

The result is that, in general, the foreign language learners get in contact with written 
language very early in the learning process. It is difficult to find a language course that does not 
use a text book. The consequence of this is that a large part of the words that are learned, are 
learned after their written forms. 
10 The present invention is based on the idea that using written text too early in the 

process of language learning creates difficulties for learning phonetics and phonology. After 
analyzing this idea, the present invention provides a way for language learning and training 
while allowing at the same time the learner to discriminate words and identify the stmcture of 
sentences, but without suffering the negative effects caused by prematurely using written texts. 

15 

Analysis of the relation between language's visual form and phonologic form. 

In general, modem science believes that there are a number of rules or relationships 
between the visual form and the phonologic fomi of words, and that these relationships 
influence the way in which the speaker can use words. 

20 These rules or relationships include as well some aspects about the characters that are 

used in the language. That is to say, if a person has become accustomed to associating a certain 
sound with the letter "b", for example, she will have a tendency to associating that sound to 
those words that contain that letter, even in a foreign language, so that learning words after their 
visual form will be influenced by it. 

25 The Stroop effect [Possner et al 1999] provides information related to this. In order to 

study the Stroop effect, subjects are shown different words that designate colors which are 
written using different colors. The tasks that subjects have to do is to name the colors in which 
the words are written. Apparently this task is easy, however, it is very difficult to perform it 
because reading the word activates an mental idea of color which is different from the color in 

30 which the word is written, which creates a conflict. 

The parallelism between the Stroop effect and learning words in a foreign language is 
that when a person reads a word whose phonological form has been heard before, there will be 
a conflict between the phonological form that is remembered and the phonological form that is 
activated by reading the word, because the activated phonological form is affected by the 

35 conversion rules of the native language. 
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Another important issue in this matter is that the perception of language sounds is 
categorical. That is to say, when the speaker perceives a sound that is somewhere between the 
sounds "p" and "b", he/she usually interprets the sound as either a "p" or a "b", rather than 
interpreting it as a vague sound that somehow is similar to both of them. Then, if the sounds of 
5 the native language are introduced into the learning process, even if only by association with 
the wiitten characters, it is reasonable to believe that these native sounds will create difficulties 
in perceiving the foreign sounds, because the categorical perception mechanism will tend to 
assimilate the foreign sounds to the native sounds. 

In this respect, Borden et al say that "Even the sounds of speech are heard within the 
10 fi^amework of our particular language, so that if we hear a less familiar language being spoken, 
we try to fit the less familiar sounds into the categories of speech sounds we have in our own 
language" [Borden et al (1994), p. 174]. 

Even in those cases of two phonetic languages in which the same transcription rules 
exist and there are no exceptions, the associations between characters and sounds can be 
15 different. For example, the stop sounds "p" and "f have different characteristics in English and 
in Spanish. In these circumstances, when the learner sees a letter "p" will associate it with the 
sound "p" of his/her native language, instead of with the sound "p" of the foreign language. 

On the other hand, if a person who is learning Spanish learns, for example, the sound 
[b] after the letter "b", besides suffering interference with other sound images that he/she might 
20 previously have for the letter "b", he/she will not perceive the fact that its associated phoneme 
is realized in different forms in Spanish, i.e. it has different allophones. For example the sound 
of "b" in the Spanish word "becerro" is different from the sound of "b" in the Spanish word 
"haber". 

In general, learning a word is learning a sequence of sounds. If a person is learning 
25 words by learning a sequence of graphical characters, this learning will condition how the 
sequence of sounds that really correspond to the word is used. 

The work of Ohala reinforces the idea that there is a strong link between visual form 
and sound form: "there is a growing body of evidence that much of what is regarded as native 
speakers' knowledge of the phonology of their language is very much influenced by, if not 
30 based on, their knowledge of how their language is spelled" [ Ohala 1995, p.716]. 
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ESSENCE OF THE INVENTION 
Introduction 

As a result of the previous analysis, the invention believes that the main problem of 
the learners of pronunciation is the interference that exists between visual form and 
5 phonological form. In order to solve this problem, the present invention proposes a solution that 
allows to enjoy the advantages of written language without suffering its disadvantages. 

The invention uses graphical entities that replace the written foiTQ of the target 
language. These graphical entities might be, for example, a line, or a sequence of characters, of 
a waveform, or other type of entity that has some linear characteristic. The invention creates a 
10 correspondence between the fragments of such graphical entities and the language fragments on 
which the learner is working, so that it allows the learner to indirectly access the content of the 
samples of target language that he/she is examining. 

In order to facilitate the exposition of the invention, a few definitions will be provided 
in the next lines. In general, in this document the terms "learner" or "user" will be used for 
15 referring to the person that is using the invention with the puipose of developing his/her 
command of a target language. 

The invention is used to work over a set of samples of a target language, each one of 
which is called in this invention ORIGINAL EXTRACT. Such original extracts can be obtained 
after contents in the target language which are larger than those on which the learner is 
20 working, or could have been created for the instmction of the target language. 

For each original extract, the invention will use one of the graphical entities that were 
previously mentioned, which will be called BLIND EXTRACTS. Each blind extract 
corresponds to an original extract, and is created specifically for it. As was mentioned, such 
blind extract has a linear structure, and there is a certain correspondence between the fegments 
25 of the original extracts and the fragments of the blind extracts. The blind extract is used to 
allow the learner to access to the different linguistic components of the original extracts, such 
as sounds, syllables, words, phrases and so on. 

For example. Exhibit 1 shows a blind extract that has been built following the criteria 
of the preferred embodiment (which is described later on in this document) for the following 
30 example original extract "The man at the bar is eating a sandwich". As can be observed, each 
syllable of each word has been replaced by a character "X". Also, the spaces that separate 
words have been maintained. 



35 
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Exhibit 1. 

Example original extract 



The man at the bar is eating a sandwich 



Blind extract 



xxxxxxxxxxx 



Original extract divided in syllables 



The man at the bar is ea-ting a sand-wich 



10 

The invention also facilitates to identify the structure of the messages that are 
received. Several techniques can be used with that goal, including those techniques described in 
the patent application [Palacios 2003]. For example. Exhibit 2 shows the escalator tree of an 
example blind segment, in which the phrases that have been chosen are only those phrases that 
1 5 correspond to subordinate clauses. 



Exhibit 2. 

Separated discrete blind extract, escalator tree. 



20 



X X 



X X XX 



X X 



X X X XX 



25 



30 



Original extract 



The man who came after the match finished is my father. 



Escalator tree of the original extract 



The man 



is my father 



who came 



after the match finished 



As will be explained later, in the preferred embodiment the invention would be 
implemented in a computerized system that would allow to select fi*agments of the blind 
extract, and therefore access to the fi'agments of the original extract to which they correspond, 
35 and, for example, listen to the aural playback of them. 
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Optionally, there could also be other mteraction means, such as for example, with the 
goal of marking certain fragments of the blind extract with some special graphical means, or to 
obtain other type of information. 

As can be seen, the blind exti^act can be used to distinguish the different words that exist 
5 in the aural production that is being listened to, and work on them, for example with the 
purpose of clarifying its meaning, without letting the learning of sounds be conditioned by the 
characters of written text. 

The invention can be used in an isolated way or can be used simultaneously with 
other systems or methods that are orientated to the comprehension and/or the learning of 
10 languages. For example, it can be used with a system that is orientated to the comprehension of 
samples of a target language in which the learner has a special informative interest, as could be 
the case with documentaries, movies or other type of content. 

Even though most of the explanation that follows will be centered in the learning of 
foreign languages, the invention can also be used for training a native language for individuals 
15 that might have some type of problem. This would be especially appropriate for persons to 
whom it is not advisable to show the written form of language, for any reason. This might 
happen, for example, in the case of individuals who could not read or write. It can also be 
appropriate for dyslexic individuals or individuals that suffer some type of aphasia. 

20 ADVANTAGES OF THE INVENTION 

As was mentioned before, the invention basically allows to discriminate words and 
identify the structure of the sentences without suffering the negative effects of prematurely 
working with written texts. In the next lines, some aspects will be explained about the 
difficulties related to word discrimination in order to better evaluate the contribution of the 

25 invention. 

Word discrimination is an extremely difficult task that the mind of a normal native 
speaker performs surprisingly easily for the native language. Actually, science does not 
understand very well yet how it this task is performed [Anderson et al (2002)], [Kager, (1995)]. 

In the case of language learning in infants, Boysson-Bardies explains that "Of all the 
30 enigmas that have arisen in trying to understand how the child acquires the words of a 
language, the segmentation of speech during the fist year has remained the most disconcerting; 
even today, the way in which the child manages to distinguish and extract words from a 
continuous sound wave retains a certain mystery" [Boysson-Bardies (2001), p. 95]. 

Logically, a similar problem occurs in the case of learners of foreign languages. In 
35 this case, the learner has the apparent advantage that he/she unconsciously knows what he/she 
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is looking for; but he/she also has other disadvantages, one of which is the immediate need to 
identify words to communicate with other people. 

Children spend around two years until they utter their first words in their native 
language, while an adult or an older kid learning a second language would not want to wait so 
5 long. The invention allows learners to quickly learn the words of the target language without 
deteriorating the learning of phonetics and phonology. 

Another important advantage of the invention has to do with the learning of languages 
in which the writing system is very complex, possibly not even based on phonemes, such as 
Chinese. In these cases, learners have many problems to use written language as support, but 
1 0 they can use instead the present invention. 

In summary, the main advantages of the invention are: 
1. to facilitate the learning of the sounds of the target language, which will produce a better 
listening comprehension, a better pronunciation, a better capacity for memorizing and 
learning of words, and indirectly a better integral learning of the language. 
15 2. to allow to create associations between the phonological fomi and the visual fonii which 
are similar to the associations that native speakers create, so that reading comprehension 
will also be enhanced. 



20 DESCRIPTION OF DRAWINGS 

EXPOSITION OF ONE EMBODIMENT 

DESCRIPTION OF THE PREFERRED EMBODIMENT 
25 General Description 

It is understood that the purpose of this section is only to present a preferred 
embodiment, and that it does not have limitative effects on the invention. 

In the preferred embodiment, the invention is created with a computerized system, 
which can be based, for example, on a personal computer such as the Dell® Dimension XPS®, 
30 adding a mouse and a keyboard so that the user interacts with the system. In the computerized 
system there exists an operating system that can be, for example, Microsoft® Windows 2000®. 

The computerized system contains a database in which a plurality of original extracts 
are stored. Such original extracts can come fi*om a larger sample of language which is 
conveniently divided in original extracts, or they can be independent original extracts which 
35 have been chosen for any reason. 



10 

The computerized system also contains a computer program that manages and 
presents the different blind extracts, manages user interaction, generates aural playbacks and 
remark the fragments of the blind extracts that will be chosen. The computer program can be 
developed for example with a development environment such as Microsoft^ Visual Basic 6.0®. 
5 The computerized system also contains a database that contains examples, and those 

examples are appropriately indexed so that the user can select a given fragment of a blind 
extract and the system would have means to show a language sample which would be similar to 
the selected fragment. For example, if the user selects a fragment that corresponds to the words 
"house", the system might show examples such as "the house is big", "my house is in New 
10 York", or other examples. If the user selects the syllable "pu", the system might show examples 
such as "reputation" or "computer". 

Types of blind extracts 

In the prefeiTed embodiment several types of blind extracts, containing different 
15 characteristics, are used with the puipose of providing additional information to the learner 
about the samples of the target language on which he/she is working. 

In total, in the preferred embodiment there exist eight types of blind extracts, which are 
described in Exhibit 3. 

20 Exhibit 3. 

1 . separated discrete syllabic structured blind extract, 

2. separated continuous syllabic stmctured blind extract, 

3. linked discrete syllabic structured blind extract, 

4. linked continuous syllabic structured blind extract, 

25 5. separated discrete syllabic non structured blind extract, 

6. separated continuous syllabic non structured, blind extract 

7. linked discrete syllabic non structured blind extract, 

8. linked continuous syllabic non structured blind extract, 

30 As can be observed, the eight extracts vary in three features, which are explained in 

this section. Each feature can have two possible features, as explained below: 

- Feature 1: "separated/linked". Separated extracts show the spaces and punctuation marks 
that exist between words, while the linked extracts do not show them. 

- Feature 2: "discrete/continuous". In the discrete extracts, the syllabic structure corresponds 
35 to the syllables that exist in the words when the words are uttered independently; in the 
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continuous extracts, the syllabic sti'ucture conesponds to the syllables that exist when the 
words are pronounced one after the other in the extract in a continuous way. 
— Feature 3: "structured/non stmcture". In the stiuctured extracts there exists an internal 
codification that reflects the phrase structure of the original extract to which it corresponds; 
5 in the non structured extracts such internal structure does not exist. 

In order to facilitate the exposition, the reference to the different types of extracts will 
be done as follows. When a blind extract is mentioned without specifying the value of any of 
the three possible features, it will be understood that reference is being made to extracts that 

10 can have both values, or to the blind extract or extracts that can be deduced from the context. 
For example, mentioning "blind extracf will mean that reference is being made to any of the 
eight types of extracts. Mentioning "blind extracf and specifying a feature, such as "discrete 
blind extracf will refer to any of the four extract types that have the feature "discrete", i.e. the 
separated discrete syllabic structured blind extract, the linked discrete syllabic structured blind 

15 extract, the separated discrete syllabic non structured blind extract, and the linked discrete 
syllabic non structured blind extract. As was mentioned, when the values of any of the features 
is not specified, the context will also be used to clarify which type of blind extract is being 
referred to. 

The feature that has the values "sti*uctured/non stmctured" is different from the other 
20 features. The reason for having this feature is that the extracts can be internally stmctured in 
phrases. The stiojctured extracts convey information about the phrases of the language sample 
that is being worked on. Their puipose is to generate the non structure extracts and to convey 
information about the phrases that exist. 

In the explanation that follows it is understood that reference is made to the non 
25 structured extracts, because they are the ones which are shown to the user, but in the preferred 
embodiment there will exist a structured extract for each non structured extract. 

In the preferred embodiment the blind extracts are generated linking "X" characters. 
This characters are grouped in CHAINS. Each chain of the blind extract is a group of characters 
that biunivocally corresponds with a word in the original extract. 
30 In what follows, the different blind extracts that can exist in the invention are 

explained. 

Separated discrete syllabic blind extract. 

In the separated discrete syllabic blind extract the chains are visually separated thanks to 
35 the utilization of the punctuation marks that exist in the original extract, such as spaces, stops. 
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etc. The blind extract shown in Exhibit 1 is a separated discrete blind extract. The 
characteristics of this bhnd exti'act are described in what follows. 

1 . each of the characters that make up the chains correspond to one syllable of the real word 
that is related to that chain, 

5 2. words and chains are related in a biunivocal relation, so that there exist as many chains as 
real words there exist in the original extract. 

3. the chains are separated by the same spaces or orthographic characters that separate the 
words that are related to those chains. 

10 Separated continuous syllabic blind extract 

This extracts takes into account the fact that when several words are uttered in a 
continuous way one after the other, the pronunciation of certain syllables is modified with 
respect to how they would be in the isolated words. This happens because of coarticulation 
phenomena and because of the prosodic rules of language. For example, the continuous 

15 production of the original extract shown in Exhibit 1 would contain the syllables shown in 
Exhibit 4. 

Exhibit 4. 

The-ma-nat-the-ba-ris-e-atin-ga-sand-wich 

20 

As can be seen, some syllables have been modified. In the continuous syllabic blind 
extract graphical means are used to link the syllables that are linked in the continuous 
pronunciation of the original extract. The characteristics of this extract in the preferred 
embodiment are the following. 
25 1 . using a hyphen ( " — ) for those syllables that belong to different words and get linked, so 
that the hyphen is located between the chains whose syllables are linked, 

2. using an arrow ( o ) for those syllables that exchange a fi^agment. The aiTow 
would be located between the syllables that intervene in that exchange, and it would be 
orientated from the syllable that loses a fragment to the syllable that gains it. 

30 3. using strikethrough font format for the syllables that disappear. 

4. using red font format for the syllables that get transformed in any other way. 

Exhibit 5 shows what the continuous blind extract would be for the original extiact 
shown in Exhibit 1, using the criteria of the preferred embodiment. 
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Exhibit 5 

Separated continuous syllabic blind extract 
X X X XX^ X XX 

5 

Example original extract 

The man at the bar is eating a sandwich 

Original extract divided in syllables in a continuous fashion 
The-ma- nat- the-ba-ris-e-atin-ga-sand-wich 

10 

Linked discrete syllabic blind extract and linked continuous syllabic blind extract 

The continuous blind extract is used to show the fact that speech does not separate 
words with pauses. Even though in written language words are clearly separated by spaces or 
orthographic signs, this does not happen in speech. In speech the identification of the different 
words is done thanks to prosody, and sometimes thanks to some distinctive characteristics in 
how words are built. 

Linked blind extracts aid the learner in discriminating the different words that make 
up speech. They are characterized because they do not have separations to indicate words. That 
is to say, the syllabic linked blind extracts are similar to the syllabic separated blind extracts, 
but they do not have separating characters between chains; the only thing that exists now is a 
consecutive collection of characters that represents the sounds of the extract. However, there 
still exist a conespondence between the syllables of the original extract and the characters of 
the linked blind extract that can be used to emphasize certain linguistic components. 

There are two types of linked blind extract, as there were two types of separated blind 
extracts. The first one is the LINKED DISCRETE BLIND EXTRACT, which shows the 
syllables of the individual words of the original extract. The second one is the LINKED 
CONTINUOUS BLIND EXTRACT, which shows the syllables of the original extract when 
such original extract is pronounced in a continuous fashion. 

30 Phrase stmcture of the blind extracts 

As was mentioned before, there exists the possibility in the invention to show the 
phrase structure of the original extract through the blind extract. The structured blind extract is 
used to do that; the appropriate phrases would have been defined in it previously. 
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Generation of aural playbacks 

The computerize system comprises a speech technology system that allows to convert 
text into speech with a plurality of characteristics. 

5 

Reference database 

The computerized system contains a speech database which contains phonetic and 
phonologic information about the different words that are used in the extracts, and which 
covers both the case of the isolated word and the case in which the word is surrounded by other 
10 words. The database contains information about: 

1. the syllabic stmcture of the words; for example, a word such as "house" would contain 
infomiation such as "ho-use" which indicates that the word has two syllables; 

2. the transformation mles for syllables for the continuous case, depending on the context 
where they are located; for example, a word such as "is" would contain information that 

15 would indicate that when it is followed by a word such as "special", the word "is" is linked 
to the "s" in "special" 

3. the effects that the syntactic structure of extracts has on the phonological behavior of the 
segment sounds; for example, in the Spanish sentence "quiero reparar el techo de la casa 
antes de comer", the last syllable of the word "casa" does not link to the first syllable of the 

20 word "antes", because they are in different levels of the phrase structure of the sentence. 

This database is the same database that is used in the speech technology system, but it 
can also be accessed to generate the blind extracts. 

25 Composition of the blind extracts 

In the preferred embodiment, the blind extracts are build by aggregating "X" 
characters. Each one of the "X" characters represents a syllable, except in the case in which 
some syllables link together or transform in some form in the continuous extract. In this case, a 
syllable of the continuous extract can correspond to two "X" characters linked with graphical 

30 means, as is explained below in this document. 

Generation of the blind extracts 

The separated discrete syllabic blind extract (SDSBE) is generated after the original 
extract by replacing each word by a group of "X" characters, so that each "X" character 
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corresponds to a syllable of the word. For this task, the information that exists in the speech 
database about the syllabic structure of words is used. 

The separated continuous syllabic blind extract (SCSBE) is generated using the 
SDSBE, applying to it the syllable transformation rules that exist in the speech database. 
5 The linked discrete syllabic blind extract (LDSBE) is generated using the SDSBE, 

removing the orthographic characters that separate the chains. 

The linked continuous syllabic blind extract (LCSBE) is generated using the SCSBE, 
removing the orthographic characters that separate the chains, but respecting the graphical 
means that mark the syllables that have been transformed. 

10 

System execution 

The computerized system comprises functionality to aurally reproduce the original 
extract, or a fragment of the original exti act, in a plurality of ways that will be indicated below. 
For example, the reproduction can be done aurally emphasizing the words or syllables of 
1 5 certain phrases. 

The computerized system has functionality to graphically emphasize parts of the blind 
extract, in a plurality of ways, either in an independent way or simultaneously to the 
reproduction of the original extract. With this functionality, when the original extract is 
reproduced, the system can graphically emphasize the words or syllables that are being 
20 pronounced ad each moment. 

Utilization of the invention 

The preferred utilization of the invention would be done as follows. When starting the 
system, the user has the option to use it in training mode or in exploratory mode. In training 
25 mode, he/she would use predefined original extracts. In exploratory mode, he/she would use the 
original extracts that exist in a language sample in which he/she might be interested. In both 
cases, the result would be that the learner would sequentially choose one or more extracts. 

The learner will generally work on the syllabic linked continuous blind extract, 
because it is the one that better reflects the nature of aural utterances. Also, the user will have 
30 the possibility to visualize any of the existing eight blind extracts that exist to work on them. 

The computerized system of the preferred embodiment allows the user to select 
certain components of the original extracts by working on the blind extract, in order to get 
information about those components. This selection can be done also on the continuous blind 
extracts, even though they might have syllables that are linked to other syllables. Even if there 
35 are linked syllables, in the preferred embodiment there exist means for distinguishing the words 
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that belong to those syllables that are linked, as shown m Exhibit 6, which represents the 
continuous extract of an original extract. In the Exhibit, the user has selected the first part of the 
syllable "ris" and the invention has detected that it is the end of the word "bar" and has selected 
it wholly. 
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Exhibit 6 

Separated continuous syllabic blind extract 



X X^ X X XX^ X XX 



Example original extract 



The man at the bar is eating a sandwich 



Original extract divided into syllables in continuous fashion 



The-ma- nat- the- ba-r is-e-atin-ga-sand-wich 



15 In order to perfomi the selection of a fragment, the user first selects one or more 

characters of the blind extract, and the system identifies the parts of the original extiact that 
correspond to those characters. In order to do that, when the user selects a fragment of the 
extract, the system would graphically emphasize the word and the phrase in which it is located, 
and it could show the syllables that are involved in the selection. At this moment, the user can 

20 choose between the following three possibilities: 

1 . one or several phrases, 

2. one or several words, 

3. one or several syllables, 



25 Once some components of the extract have been selected, the invention can offer the 

user different types of information about those components. 

1 . One type of information can be other texts that contain the same types of components that 
have been selected, syllables words or phrases, which could be emphasized. 

2. Another type of information is the aural reproduction of a fragment of the extract, which 
30 can be the whole extract. The aural reproduction can also be performed for the examples 

that have been mentioned in the previous lines. 

3. Another type of information would be provided by generating aural reproductions on the 
original extract or on a fragment of it, but applying certain characteristics that would 
facilitate learner comprehension. These special characteristics can be achieved by 

35 modifying the speed of the reproduction, either globally or locally, or by modifying the 
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tone, intensity and/or duration of the sound segments that exist in the fi^agment. This 
reproduction can be performed by special recordings that would have been made 
previously, or by the utilization of text to speech converters, or by using both ways. The 
different types of reproduction might have the following characteristics: 
5 — Playback the fragment at a lower speed than nomial. 

- Playback the fragment aurally emphasizing the syllables, i.e., marking the different 
syllables, in order to help the learner to identify the different syllables. 

- Playback the fragment word by word, in order to facilitate the identification of the 
different words. 

10 — Playback the fragment aurally emphasizing certain parts of the fragment, which can be 

segments, syllables, words or phrases. 

- Playback the fragment amplifying the frequencies that are located around 2000Hz, in 
order to facilitate the distinction of the information that is located in that part of the 
spectrum, which is specially important for discriminating fricative sounds [Borden et 

15 al,p. 177]. 

4. Another type of infomiation would be provided by graphically marking different parts of 
the fragment, either in synchronization to the aural reproduction or independently. The 
synchronized reproduction is specially useful in the case in which certain parts of the 
fragments are being aurally emphasized. 

20 

When using the invention, it would be advisable to provide slow reproductions at the 
beginning of the instruction and gradually increase the speed to assist the learner. It is also 
important to include reproductions done at normal speed from the beginning, so that the learner 
is exposed to authentic language. 
25 The ideal moment to start using the invention in the case of foreign language learning 

is the beginning of the instruction, but it can also be used for learners that have already started 
the instruction, with the goal of eliminating those incorrect habits that they might have 
acquired. 

The invention can be used in all kinds of exercises and activities in which there is a 
30 fragment of aural language that contains certain infoiTaation for the learner. In this case, the 
invention would allow the learner to listen to the information and review parts of it without 
having to look at the written form, with the result training in listening skills. 



35 
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DESCRIPTION OF OTHER EMBODIMENTS 

There are many alternative ways to build the invention, and some of them will be 
described in this section. 

In an akemative embodiment, the graphical means that are used for the blind extract 
5 are not characters; it is possible to use, for example, a hyphen, as shown in Exhibit 7. It can be 
seen that in one of the cells, the blind extract has been horizontally elongated in order to align it 
with the original extract. It could also be possible to use a waveform diagram or any other type 
of graphical means. 

10 Exhibit 7, 

Example original extract 

The man at the bar is eating a sandwich 

Blind Extract 

15 I 

Blind Extract that Has Been Adapted to the Original Extract 



The man at the bar is ea-ting a sand-wich 



Original extract divided in syllables 

20 

When the blind extract is created using graphical means which are not characters, the 
different parts of the original extract could correspond to different fragments of the extract. In 
other embodiment, the blind extracts use characters different from "X". In general, it is 
advisable that the characters that are used do not exist in the target language nor in the native 
25 language of the learner, in order to reduce the interference between written form and sound 
form. 

In another alternative embodiment, the different fragments of the separated blind 
extract are separated by other graphical means different from the spaces and orthographic 
characters that are used in the preferred embodiment, so that they could be separated by special 
30 characters or by geometric figures such as lines or other figures. 

In another alternative embodiment, the separated blind extract could contain a chain 
that would correspond to more than a word of the original extract; for example, the words "the 
man" might be associated to a chain such as "XX". 



19 

In another alternative embodiment, the bUnd extract might contain some of the words 
that exist in the original extract; for example, an original extiact such as "the man lives in 
Madrid" might generate an separated discrete syllabic blind exti'act such as "the man X X XX". 

In another alternative embodiment, not the whole original extract would be included 
5 in the blind extract, or it might happen that some parts of the blind extract would not 
correspond to any part of the original extract. 

In another alternative embodiment, the invention is built without a computerized 
system, and is built, for example, combining a system that can reproduce sounds and a paper 
based object. In this alternative embodiment, the blind extract in paper might be a syllabic 
1 0 extract and the audio reproductions might include some reproductions in which the syllables are 
aurally emphasized. 

Also, in a similar embodiment that would be based on an audio system and a paper 
based object, some special fragments of the original system might be marked graphically in the 
blind extract, and might simultaneously be aurally emphasized as well. 

15 In another possible embodiment, the invention is build without a computerized 

system, but uses an electionic system, such as for example a television. The television screen 
would show the blind extracts and the audio of the television would create the aural 
reproductions. As in the previous case, some fragments of the blind extracts that appear on the 
screen might be graphically emphasized, and they can also be aurally emphasized in the aural 

20 reproductions. 

Another possible embodiment uses SEGMENTAL BLIND EXTRACTS, which show 
the sound segments that make up the syllables. The segments are the sound units identified in 
speech that make up the syllables. Using segmental extracts creates the following additional 
tj^es of extracts. 
25 1 . separated discrete segmental structured blind extract, 

2. separated continuous segmental structured blind extract, 

3. linked discrete segmental structured blind extract, 

4. linked continuous segmental structured blind extract, 

5. separated discrete segmental non structured blind extract, 
30 6. separated continuous segmental non structured, blind extract 

7. linked discrete segmental non structured blind extract, 

8. linked continuous segmental non structured blind extract. 



